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94. (Amended) The nucleic acid of claim 57, wherein the [functional] additional domain is 
heterologous with respect to the two nucleic acid-binding domains. 

96. (Amended) The method of claim 76, wherein the [functional] additional domain is 
heterologous with respect to the two nucleic acid-binding domains. 



Claims 40-70 and 71-88 are pending. Claim 40 has been amended to correct for 
typographical errors. Claims 57, 58, 63, 65, 76, 77, 94 and 96 have been amended for improved 
clarity and to be in better condition for appeal. No new matter has been added. 

Cancellation and or amendment of claims should in no way be construed as an acquiescence 
to any of the Examiner's rejections. The cancellation and/or amendments to the claims are being 
made solely to expedite prosecution of the present application. Applicants reserve the option to 
further prosecute the same or similar claims in the instant or in a subsequent patent application. 

Rejection of claims 40-70. 72. 89-92, 94-95. and 97 under 37 C.F.R. § 103 (a) in view of Park et 
aL. Mitchell et al.* Harrison and Schultz 

Claims 40-70. 72. 89-92, 94-95, and 97 have been rejected under 37 C.F.R. § 103 (a) as being 
unpatentable over Park et al. {PSAS 89: 9094 (1992)), in view of Mitchell et al. {Science 245:371 
(1989)), Harrison (Nature 353:715 (1991)), and Schultz (Nature 240: 426 (1988)). Applicants 
respectfully traverse this rejection. 

Claim 40, and claims 41-65 and 72 dependent therefrom, are drawn to a nucleic acid 
encoding a chimeric protein which binds to a nucleic acid comprising a composite binding site, the 
chimeric protein comprising two nucleic acid-binding domains, each of which binds a sequence 
which is a portion of the composite binding site, and wherein only one of the two nucleic 
acid-binding domains includes a zinc finger motif. Claim 66, and claims 67-70 dependent 
therefrom, are drawn to a nucleic acid encoding a chimeric protein which binds to a nucleic acid 
comprising a composite binding site, the chimeric protein comprising two nucleic acid-binding 
domains, each of which binds a sequence which is a portion of the composite binding site, and 
wherein only one of the tw o nucleic acid-binding domains is a nucleic acid-binding domain from 
a homeodomain containing protein. 



Remarks 
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Park et al. is relied on by the Examiner as teaching "a general strategy for designing proteins 
to recognize specific DNA-bindmg sites" and that "[t]his technique creates a protein that recognizes 
the composite site (page ^094, column 1)." The Examiner indicates that Park et al. "do not teach to 
specifically use the DNA-binding domains from distinct families of nucleic acid binding domains, 
use of specific types of domains such as zinc-finger domains." 

Mitchell et al. is relied on by the Examiner as teaching that "different DNA binding 
transcription factors are composed of a surprising variety of usually separable DNA binding and 
transcriptional activation domains (page 372, column 2)." 

Harrison is relied on as teaching that "many DNA-binding proteins recognize specific sites 
through small, discrete domains and that these domains can be interchanged between proteins, 
showing that these domains are independent folded units." 

Schultz is relied on as teaching that "enzymes can be created by adding or replacing entire 
binding or catalytic domains to generate hybrid enzymes with novel specificities" and that 
"[selective fusion of nucleic acid-specific binding domains may produce sequence-specific DNA 
or RNA cleaving enzymes (page 431, column 1)." 

It is the Examiner's position that 

[i]t would have been obvious to one of skill in the art at the time the invention was 
made to use the various DNA binding domains, activation domains, and cleavage 
domains taught by Mitchell et al., Harrison, and Schulz in the general strategy for 
designing proteins to recognize specific DNA-binding sites taught by Park et al. 
because Park et al. teach that it is within the ordinary skill in the art to stitch the DNA 
binding domains together from any proteins that recognize a specific DNA sequence 
by binding along the major groove, to recognize a composite site and Mitchell et al., 
Harrison, and Schultz teach such domains that can be functionally separated and 
recombined with other domains. 

Applicants submit that the cited references alone, or together with the general knowledge in 
the art at the time the invention was filed, fail to provide neither sufficient motivation to combine 
them to obtain the claimed nucleic acids nor the requisite reasonable expectation of success. 

However, applicants did not simply infer, as asserted in the Office Action, that "because one 
of the references in the rejection did not teach the whole invention, there would not have been a 
reasonable expectation of success." On the contrary, Applicants' noted that Park et al. fail to provide 
any motivation to combine the references and to provide a reasonable expectation of success, and 
that none of the secondary references cited cured this defect. 
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The heart of the Examiner's position appears to be that Park et ah alone "provides very strong 
motivation to make chimeric DNA-binding proteins that bind to composite sequences by fusing two 
previously separate D\A binding domains together." Applicants disagree. 

Applicants respectfully submit that the Examiner's conclusion requires one to read the 
excerpted Park et al sentences out of context. In view of the actual disclosure of the Park et al 
reference, a person of skill in the art would understand the general statements made in Park et al. to 
refer to arms of dimeric DNA binding proteins, and would not give broader meaning to statements 
in that reference that could otherwise be given broader interpretation, i.e., if taken out of that context. 

There are several lines of evidence that even Park et al. did not intend the general statements 
in their paper to be given the broader interpretation reflected in the office action. 

First, Park et al.'s later published paper (PNAS 90: 4892 (1993); copy of which is enclosed 
herein as Exhibit A), defines the concept of protein stitchery, referring to their '92 paper, as 
representing that "individual basic amis (half sites) of the dimer and the individual half sites of the 
DNA can be recombined or stitched together in various sequences to form new proteins selective for 
binding to the new DNA sites" (emphasis added; see paragraph bridging pages 4892 and 4893). 
Thus, Park et al. defined the general concept of protein stitchery as combining "arms" of proteins 
which normally form dimers. 

Second, if Park et al truly intended the generic meaning represented by the Examiner, they 
wouldn't have limited themselves to proteins which bind along the major groove of the DNA. 
Clearly there are DNA binding proteins which bind along the minor groove and indeed 
homeodomains and zinc finger domains, such as are used to illustrate the subject invention, do make 
minor groove contacts. 

In addition, the publication of applicants' work in Science, as discussed further below, belies 
acceptance in the art of broader conclusions from the Park et al reference. 

Accordingly, Applicants again submit that the general statements made by Park et al. would 
not have been viewed, and should not now be viewed, as being as broad as the Examiner contends, 
and would not provide the requisite motivation to make the claimed chimeric proteins. 

Moreover, as Applicants explained before, Parks et al. cross-linked together the DNA binding 
domains of two proteins which normally bind DNA only in the form of a homodimer. However, 
Park et al. does not teach or suggest that a chimeric protein having a composite DNA binding 
domain consisting of tw o or more DNA binding domains from different types of DNA binding 
proteins, which do not normally interact with each other, would bind DNA with higher affinity to 
the composite binding site than to portions of it. Harrison and Mitchell et al. merely teach that DNA 
binding domains can be separated from the rest of DNA binding proteins. However, these teachings 
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do not fail to cure the defect of Park et al. Furthermore, neither Harrison nor Mitchell refer to 
portions of DNA binding domains, e.g, individual zinc finger domains, which can also constitute part 
of Applicants' claimed composite DXA binding domain. 

The Examiner also states that M [a]bsent evidence to the contrary, there would have been a 
reasonable expectation of success that the domains taught by Mitchell et al. and Harrison could be 
combined with each other to create a protein that recognizes a composite binding site as taught by 
Park et al." How ev er, as set forth above, Mitchell and Harrison merely teach that DNA binding 
domains can be separated from the rest of DNA binding proteins. There was no reasonable 
expectation of success that a chimeric protein containing a composite DNA binding domain 
consisting of DNA binding domains from proteins which are unrelated, as claimed by Applicants, 
would bind to a composite DNA binding site with higher affinity than to each of the half sites to 
which each of the DNA binding domains of the composite DNA binding domain bind. Nor was 
there any reasonable expectation of success that binding of such a chimeric protein containing a 
transcriptional activation domain to a DNA binding site would be able to stimulate transcription of 
a target gene operably linked to the DNA binding site, as Applicants showed. 

Further support that there was no reasonable expectation of success to obtain the claimed 
invention is provided by a statements made by one of the inventors of the instant application after 
publication of their invention in the journal Science: "laboratory tests have proved the artificial 
switch can find, and control, a single gene among the 80,000 that exits in humans." The article 
(attached hereto as Exhibit B) also quotes Carl Pabo, as stating "the critical thing was showing it can 
bind the proper site." Furthermore, if there had been a reasonably expectation of success, Applicants' 
description of the invention would not have been published in the prestigious peer-reviewed Science 
journal (Pomerantz et al. (1995) Science 267: 93, attached hereto as Exhibit C). 

With regard to making a nucleic acid and vector comprising the nucleic acid which encodes 
the chimeric protein, the Examiner states that "it would have been obvious to do so because Parks 
et al. teach that a continuous approximately 70 amino-acid protein that should recognize a 
predictable site can be made, instead of using a cysteine linker, and thus it would have been obvious 
to make a nucleic acid that encodes this protein and place the nucleic acid in a vector to express the 
protein, because such a way of making a mutated, recombinant protein is and was well known in the 
art." Applicants respectfully traverse this statement. Although Parks et al. may make general 
statements, i.e., speculate, that the cross-linking between the two monomer DNA binding domains 
(presumably of a dimeric DN A binding protein) could be replaced by a peptide bond, a person of 
skill in the art at the time the invention was made would have known that a peptidic bond may give 
rise to a protein structure that is different from that resulting from disulfide cross-linking. Thus, 
there was no reasonable expectation of success that two different DNA binding domains linked 
together through a peptidic bond would form a chimeric DNA binding protein. 
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Thus, in view of all of the above. Applicants respectfully request that the Examiner 
reconsider and withdraw rejection of claims 40-72 under 37 C.F.R. § 103 (a) as being unpatentable 
over Park et al., in view of Mitchell et aL Harrison, and Schultz. 

Rejection of claims 40-70 and "2-88 under 3 7 C.F.R. § 103 fa) in view of Park et aL Mitchell et 
aL* Harrison. Schultz and Gossen et al. 

Claims 40-70 and 72-SS have been rejected under 37 C.F.R. § 103 (a) as being unpatentable 
over Park et al. (supra), in view of Mitchell et al. (supra), Harrison (supra), Schultz (supra) as applied 
to claims 40-70, 72. S9-92, ( M-05. and 97 above, and further in view of Gossen et al. (U.S. Patent 
No. 5, 464,758). Applicants respectfully traverse this rejection. 

Claim 40 and claims 41-65, and 72-74 dependent therefrom are drawn to a nucleic acid 
encoding a chimeric protein which binds to a nucleic acid comprising a composite binding site, the 
chimeric protein comprises two nucleic acid-binding domains, each of which binds a sequence which 
is a portion of the composite binding site, and w herein only one of the two nucleic acid-binding 
domains includes a zinc finger motif. Claim 66 and claims 67-70 dependent therefrom are drawn 
to a nucleic acid encoding a chimeric protein which binds to a nucleic acid comprising a composite 
binding site, the chimeric protein comprises two nucleic acid-binding domains, each of which binds 
a sequence which is a portion of the composite binding site, and wherein only one of the two nucleic 
acid-binding domains is a nucleic acid-binding domain from a homeodomain containing protein. 
Claims 75-83 and 84-89 are drawn to a method for modulating expression of a gene in a cell, 
comprising modulating the level of a chimeric protein in a cell which includes a gene operably to 
a composite binding site to which the chimeric protein binds, wherein the chimeric protein comprises 
two nucleic acid-binding domains, each of which binds a sequence which is a portion of the 
composite binding site 

Park et al., Mitchell et al., Harrison and Schultz are relied upon by the Examiner as disclosing 
what is summarized in the previous section. Gossen et al. is relied upon by the Examiner as teaching 
"a nucleotide molecule coding for a chimeric transactivator fusion protein comprising a DNA 
binding domain (tet repressor binding domain) and a transactivation domain (such as VP16 of 
HSV). M 

It is the Examiner s opinion that "[i]t would have been obvious to one of ordinary skill in the 
art at the time the invention was made to form a transcriptional regulatory system from the DNA 
encoding a chimeric transactivation protein made obvious by the teachings of Park et al. (AW2), 
Mitchell et al. (S), Harrison (T) and Schultz (U), using the method taught by Gossen et al. because 
Gossen et al. teach that it is within the ordinary skill in the art to make a nucleic acid vector that 
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encodes a chimeric transactivator fusion protein (under the control of a promoter active in eukaryotic 
cells), make a nucleic acid encoding a heterologous protein operably linked to a regulator binding 
site that the chimeric protein binds to..." 

Applicants respectfully submit that, as set forth above, Park et al. (AW2), Mitchell et al. (S), 
Harrison (T) and Scluiltz (L*) do not make obvious a chimeric transactivation protein. Thus, even 
if, as contended by the Examiner. Gossen et al. teach that "it is within the skill in the art to make a 
nucleic acid vector that encodes a chimeric transactivator fusion protein (...), make a nucleic acid 
encoding a heterologous protein operably linked to a regulator binding site that the chimeric proteins 
binds to, place the nucleic acid in a eukaryotic cell..." the cited references fail to provide any 
motivation to combine the references and to provide a reasonable expectation of success. 

Thus, in view of all of the above, Applicants respectfully request that the Examiner 
reconsider and withdraw the rejection of claims 40-70 and 72-88 as being unpatentable over Park 
et al., in view of Mitchell et al., Harrison, Schultz and further in view of Gossen et al. 



In view of the above remarks and the amendments to the claims, it is believed that this 
application is in condition for allow ance. If a telephone conversation with Applicant's Attorney 
would expedite prosecution of the above-identified application, the Examiner is urged to call the 
undersigned at (617) 832-1000. 



Conclusion 



Respectfully submitted, 
FOLEY, HOAG & ELIOT LLP 



By: 



One Post Office Square 
Boston, MA 02109 
Telephone: (617) 832-1000 
Facsimile: (617) 832-7000 



Matthew P. Vincent. Ph.D. 
Registration No. 36,709 
Attorney for Applicants 



Date: February 4. 2000 
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Vol". 90, pp. 4892-4896. June 1993 
Biochemistry 



Design superiority of palindromic DNA sites for site-specific 
recognition of proteins: Tests using protein stitchery 

Changmoon Park**, Judy L. Campbell^, and William A. Goddard IU*t§ 

•Materials and Molecular Simulation Center, Beckman Institute (139-74), 1 Divisioo of Chemistry and Chemical Engineering, and *Division of Biology, 
California Institute of Technology, Pasadena, CA 91125 

Contributed by William A. Goddard III, February 12, 1993 



ABSTRACT Using protein stitchery with appropriate at- 
tachment of cysteines linking to either C or N termini of the 
basic region of the v-Jun leucine zipper gene-regulatory pro- 
tein, we constructed three dimers — pCC, pCN, and pNN. All 
three bind specifically to the appropriately rearranged DNA 
recognition sites for v-Jun: ATGAcgTCAT, ATGAcgATGA, 
and TCATcgTCAT, respectively (K 4f ~4 nM at 4°C). Results 
of DNase I footprinting provide strong support for bent rec- 
ognition helices in leucine zipper protein-DNA complexes. 
Comparison of the results for pCC and pNN with those for pCN 
shows the design superiority of palindromic sequences for 
protein recognition. 

The mechanism by which cells respond to external stimuli is 
a fundamental problem in modern biology. Transcriptional 
regulatory proteins are known to play a key role in several 
systems evolved by cells to convert extracellular signals into 
altered gene expression (1). They operate by specifically 
binding to DNA target sites, which regulate the transcription 
of particular genes. Prominent among transcriptional regu- 
latory proteins arc the leucine zipper family of proteins, 
which recognize the DNA binding site as either homodimers 
or heterodimers (2-4). 

The leucine zipper proteins are characterized by two 
functional segments: (/) the leucine zipper region, a helical 
region containing four or five leucines spaced at seven-amino 
acid intervals, and (if) the basic region containing many basic 
residues (5-10). The basic region appears to be unfolded in 
solution but assumes an o-helical structure binding to its 
recognition site (11-13). Site-directed mutagenesis (6, 7) and 
domain swapping (8-10) experiments show that the leucine 
zipper region mediates dimerization and that the basic region 
is responsible for DNA binding. Experiments replacing the 
leucine zipper region with a three-peptide linker (14, 15) 
showed that the dimerized basic region recognizes the same 
site as the native protein, supporting the scissors grip model 
(5), where each monomer recognizes the half site of the 
symmetrical DNA binding site. Recently, we showed that the 
normal dimer (denoted pCQ, which selectively recognizes 
the sequence ATGAcgTCAT, can be inverted to form a 
protein (denoted pNN) that selectively recognizes the in- 
verted site, TCATcgATGA (15). 

Gel electrophoresis experiments (22) with Jun homodimer 
and with Jun-Fos heterodimer showed that Jun and Fos 
induce DNA bending in the opposite direction upon binding 
to the specific site. To explain this, it was proposed that the 
basic region of jun has a bent a-helix, while the basic region 
of Fos has a straight helix. However, a recent x-ray crystal 
structure (21) for the complex between GCN4 and DNA 
containing the GRE site (ATGACTCAT) showed a straight 
single a-helix for the basic region of GCN4. Our current 

The publication costs of this artjele were defrayed in pan by page charce 
payment. This article must therefore be hereby marked "advertisemmi" 
in accordance with 18 US C. §1734 solely to indicate this fact. 



Peptide* (Aaino end at l*ft) 

T-Jun-br: 5 QZRXXAERJCR KRKRXJULSX^ KXMCLERXAR 

Jun-N : CCC S QERXKAZRKR KXKRIAJLSXS RXFJCLERXAR 
v-Jun-C : S QERDCJUERKR MKKRIAASK3 KKWCLXRIAR CGC 



Oligonucleotide* 

»'-«c*9*tccOT«tcctJ9gtt*MC9XT<^9TCAXc9qrt»t*79to9*9**cccg9*Crt-j' 
' J '^*9cct*99cct*w«tcc«ttt9ctAi3gcAjCIAflcc«t*tcc*7ctctt««gcct*9«-s • 

qj . S *-ctc^»t<xg7*tcctM7tt**«c«>XTCXC9AJ^Xc97t»t*7qt C9«7*«ttcg7atct-l ' 
i • -q a <jz ct *<j <jc ct * 79 • t ec*a 1 1 1 gcTACTgc IACTgcc*t* cc c*t« ct: t n^cct *<j* - j • 

VH . v-ctc^^tcc9g*tcct»^t*AJC^^XTc9XTGAcq 9 t»t*wc<j*9««cco99*tct-i- 
l • gx ct * weet *<j 9 * t c c«* 1 1 1 ?cAGTAgcTACT gc c« ti t c c*<jc t c 1 1 u qccx *g « - j ■ 



Disulfide bond, formation to mmXi 



• pCC (or pHNJ and pCN 




Fig. 1. Sequences of protein (a) and oligonucleotides (b) used in 
gel-retardation and footprinting studies. Total length of each oligo- 
nucleotide is 62. v-Jun-br contains the basic region of v-Jun. CGG is 
added to the N terminus of v-Jun-br to make v-Jun -N and GGC is 
added to the C terminus of v-Jun-br to make v-Jun-C, Proteins were 
chemically synthesized and checked by mass spectroscopy at the 
Biopoiymer Synthesis Center ax the California Institute of Technol- 
ogy (15). (c) Strategy for making pCC (and pNN) and pCN dimers. 
v-Jun-C was incubated with 10 mM dithiothreitol (DTT) for 5 hr at 
rooni temperature and purified directly into 10 mM 2,2'- 
cUthicxlipyridine/100 mM sodium phosphate, pH 5.5, <xra taming 30% 
acetomtrQe, Resulting thiopyridyi-(v-Jun-C) was purified by HPLC. 
Purified monomer v-Jun-N underwent reaction with 2 equivalents of 
thiopyridyl-(v-Jun-C) in solution containing 100 mM tetraethylam- 
monium acetate buffer (pH IS) and 15% aeetonhnle for 12 hr at room 
temperature. The final product, pCN, was purified by HPLC (15). 

results support the interpretation that the v-Jun homodimer 
bound to its specific site has bent a-heiices. 

Peptide Design 

Using protein stitchery, we have made three kinds of v-Jun 
(16, 17) homodimers (denoted pCC t pNN, and pCN) and 
show here that each selectively recognizes the appropriately 
reorganized DNA binding sites ATGAcgTCAT, TCATc- 
gATGA, and ATGAcgATGA (see Fig. 1). The concept of 
protein stitchery (15) is that the individual basic arms (half 

$To whom reprint requests should be addressed at *. 
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Probe-DNA I CC 1 CN 1 NN 1 

Peptide - pCCpCNpNN - pCCpCNpNN - pCCpCNpNN 




Probe-DNA 



CC 



-1 t- 



CN 



NN 



:op i — Bottom — l r 

zCC 1 I" 



■Top i — Eottorn — t r 

pCN 1 t~ 



Top- 



■ Bottorr- 



• pNN • 



Peptide |A/G h ia/G — H~ I IA/G — + 



- -I- 1 IA/G - -r IA/G 




*L aibuL at 50 ^^^^^SSSS^SS^ 3£ ™ «^ *s? rr, d isrS 



s^un^ 

in a 1M reaction volume. After 5000 cpm of each 5 "F-WkW ' P™^ A ^ determined by titration of the gel shift. - 2 nM for 
SrecOy on an 8% nondeoaturing polvacryiamide gel * Jiu^TA b^er_at4 ^JV ^ ^ dde t^ds specifically to iU proposed 
^CC *£■ 6 nM for pCN/CN , and - 4 nM for pKN/NN . all « ^^^^Xd^^DN A containing the predicted binding 
, and not to theother sites. (6) DNase I footprints ^ . 0 ^' VzTconuL torine serum albumin at 50 ws/ml, 5% (vol/voT) 

I SoVrCC pCN. and pNN, respectively. FootprmUng assay solu^fm 50^) wn^nswmn DNA, and the appropriate 

* Dcotideat 50 nM. This solution was stored at 4*C for 40 nun. AtteO ^ . f DNasc x stop solution contauung 15 mM EDTA (pH 

t Stored 1 min more at 4«C. DNase I digestion was stopped by phcnol^orofonn^tracted. ethanoh^dprtated .and 

V gXlOOmM NaO. and sonicated salmon sperm DNA at 40 «M- ^^^S Uer, denatured at 90X for 4 nfe. and analyzed 

: washed with 70% (vol/vol) ethanol. The pellet was resuspended m ^^^JS^edBcdly binds to the proposed bmtag «te and 

: on 10% polyacryUmidc sequencing gel <^ 

aT^it^^ 



sites) of the dimer and the individual half sites of the DNA can 
be recombined or stitched together in various sequences to 
form new proteins selective for binding to the new DNA sues. 
Thus, we use here the recognition helix v-Jun-br of Fig. la 
with a cysteine linker at either the N (v-Jun-N) or the C 
(v-Jun-C) terminus. These can be combined to form either 
pNN pCC, or pCN dimers as illustrated in Fig. lc. Forma- 
tion of pNN and pCC (via pathway I) is straightforward since 
each involves dimerization of identical monomers. To ensure 
formation of pCN, the cysteine at the C terminus of v-Jun-C 
was reacted with excess 2,2'-diihiodipyri<iine to form thiopy- 
ridyl-(v-Jun-C) (18, 19) and then coupled with the cysteine at 
the N terminus of v-Jun-N to form the pCN dimer (v-Jun- 



Q-S-S-<v-Jun-N) (pathway III; Fig. lc). We also verified 
pathway II for forming pCC. 

Results and Discussion 

We carried out eel-retardation assays (15) for each of the 
three peptide dimers with oligonucleotides (Fig. lb) corre- 
sponding to each of the three proposed binding sites. These 

results (Fig 2a) show that each dimer recognizes the appro- 
priate binding site specifically with no detectable binding to 
the other sites. It is important to note that this strong 
preference for dimer occurs even though all oligonucleotides 
contain proper sites for binding a single arm of each dimer. 
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Fig. 3 . Schematic diagram for the complex between peptides and 
their corresponding DNA sites assuming a bent recognition helix 
(a~c) and a straight recognition helix {d-f). (a and d) Complex 
between pCC and probe DNA CC. (b and c) Complex between pCN 
and probe DNA CN. (c and f) Complex between pNN and probe 
DNA NN. The linker connecting two monomers indicates a disulfide 
bond between the cysteines attached to the end of peptides. In each 
case, the side view is on the top and the top view is on the bottom.- 
Outer and inner circles of the top view represent the outer and inner 
major groove surfaces of the top strand for the proposed binding site 
projected onto an imaginary plane perpendicular to the axis of DNA 
and running through the center of the peptide and binding site. 
Shading is used with the peptide and DNA contacts to ease the 
tracking of these regions in different cases. This diagram shows that 
a bent recognition helix can contact the same 4 bases for all three 
peptide dimers, while a linear recognition helix would contact 
different bases in the three peptide dimers. (This diagram b not 
meant to imply an exact correlation between where the basic region 
is bent and where the bases are positioned.) ~- . - v 

Therefore, at 3 nM peptide coacentration the dimer does not 
make a stable complex with DNA unless both arms in the 
dimer recognize their proper sites, This implies cooperation 
between the monomers in recognizing the binding site (20). 
Since all three dimers have similar (2-6 nM) binding affinities 
with their own sites and since all three lead to the same length 
region protected from DNase I digestion (see below), we 
conclude that (/) all three cases involve similar conformations 
in the complex between DNA and peptide, and (U) the 
monomer arm retains the same contact region in various 
dimers; this occurs despite the changing orientation of the 
monomers in the various peptide dimers (15). 

There are two major models for the bound conformation of 
leucine zipper protein to the specific site. One is the induced 
helical fork model (13) t which proposes a straight single 
a-helix for the basic region, and the other is a scissors grip 
model (5) which proposes a bent a-helix for the basic region. 
The recent x-ray crystal structure (21) for the complex of 
GCN4 containing only the basic and leucine zipper region and 
DNA-containing GRE site showed that the basic region of 
each protein has a straight cr-hchx conformation recognizing 
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Fig. 4. Specific binding of protein at and near the corresponding 
DNA binding site, (a) Complex pCC/CC. (b) Complex pCN/CN. (c) 
Complex pNN/NN. O represents specific binding; X represents 
nonspecific binding. pCN/CN has one specific binding site and two 
sites for semispecific (half site) binding near its (noopalindromic) 
binding she. However, pCC/CC and pNN/NN do not allow semi- 
specific binding near their (palindromic) binding sites. - - - - y-~- 

each half site of the dimer binding site. There was no DNA 
bending caused by protein binding (21). However, there 
remain many problems with assuming that the basic region is 
in all cases a straight a-helix: (i) The bases flanking the active 
site affect the binding of leucine zipper protein even though 
the crystal structure shows no direct contacts with protein 
(21). (/f) Gel electrophoresis experiments using Jun ho- 
modimer and Jun-Fos heterodimer showed that Jun and Fos 
induce DNA bending in opposite directions upon binding to 
their site (22), whereas GCN4 does not induce DNA bending 
(21, 27). (Hi) Even though GCN4-br (a peptide containing the 
basic region of GCN4 protein) showed no specific binding 
(for details see ref. 14), we find that the monomer v-Jun-br (a 
peptide containing only the basic region of v-Jun; see Fig- la) 
specifically binds to the dimer site and shows the same 
protection as the dimer. Our conclusion then is that there is 
no universal model for the DNA-bound conformation of the 
basic region of leucine zipper proteins. Whether it is linear (as 
in GCN4) or bent (as in Jun) depends on the specific primary 
sequence and the properties of the solutions (stabilizers, pH, 
etc.) used in the experiments. 



Biochemistry: Park et al. 

The result that all three dimers (pCC. pNN. and pCN) bind 
~Z\v to the appropriate combination of oligonucleotide 
S Sri SsSi teheBcal binding arm is bent (5 22) (see Fig 
' X argument is as follows. The optimum bmdmg s.te for 
ofcJun tomodimer and the Jun-Fos heteroduner is known to 
^ ATGAcTCAT or ATGAcgTCAT, where the mner 7 0^8 
les play an important role in recognition (23. 24). The x-ray 
^stal structure of GCN4 bound to DNA leads to stratght 
Tehees whfch have direct contacts with only the inner 7 
c the GRE site (ATGACTCAT). Thus, each arm 
Tco n£ the hS-s £ (gATGAc or gTCATc) of the dimer 
£?ZL asvmmetricaUy . If the same were true for v-Jun 
SK^SSS^ are ^^^^ 

f^cZZ £ SS. then the orientations of the bmduig 
^would have very different orientations (Fig. 3 e and/), 
^should result in different protection from DNase I diges- 
S(nSS,S«S I" addition, for the pNN/NN complex^ 
52" would lead to N termini of the two armstoo distant to be 
Scted b?the added linker, GGCCGG. The aUemauve to 
fie 3 "/is for each dimer to have the same angle (as in F.g 
25' mis the actual contact region would not be equivalen 
n the three cases and it would be difficult to explain the ge 
Nation and footprint^ results. Thus. we 
for v-Jun the basic region becomes bent upon bmding to the 

D On the other hand, with the recognition helix bent roughly 
at the middle of the helix (as indicated in Fig. \^\^ 
nlausible that the contact reg.ons are ATGAcgTCAl tor 
tec TCATcrATG A for pNN. and ATG AceATGA for pCN 
SlS^o equivalent contact regions in all three cases and 
to the roughly equivalent binding energies apparent u Fig. 
£ ta 3dWon.7ootprinting (15) of the three pept.de doners 
f£g. 2b), each with the appropriate ohgonucleot.de dun«, 
suSests that the complcxed peptide dimers protect the fuU 
pSS site (all 10 bp) from DNase . I ^gesUon -The* result 
strondv support the bent recognition helix model for the 
basic Reconsidered here and hence also for the leucne 

^ffS^^Kfootprinting (Fig. 2b) shows 
Complete protection on the binding site and partial protec- 
do C n° cabases flanking the binding ; she wherea, for 
dCC/CC and pNN/NN this does not happen. This occurs 
even though gel-retardation assays indicate specific binding 

or aU Sexes. Our explanation of this (Fig 4) suggests 
why palindromic sequences are so common for selective 
2di£tf regulatory proteins (25, 26). This reasoning ; is 
supported by decent results we have observed showing that 
STe monomer of v-Jun containing only the basic region 

v-Jun-br) specifically protects both pCC and .pNN bmdmg 

for the DNA probe carrying a sequence of cgATUAcgi 
S^gT^TeT(containii pCC and pCN binding sites over- 
haJoflach dimer binding site in die cente than £r 
CC (and CN) probe DNA. These results imply that the half 
Sg?CA?c^ 

binding site interferes with the binding of pCC (or pCNT c the 
dimer binding site (because the half site can be used a^ , * 
binding site for each arm of the dimer if the orientation 
"en the site and arm fits). Details of these result wdl be 
^Wisned elsewhere Fig. 4 indicates the strength of binding 
KtSeep^ride dimers at or near their DN A . recogmt.on 
sites. Here. O represents good bmdmg. while X represents 
nonspecific binding. The palindromic s.tes for pNN and pCC 
lead to binding only when the protein ,s exactly a the 
recognition site, whereas pCN can recoemze both full site 
(both arms bound) and half sites (one arm bound). In gel 
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retardation and DNase I footprinting. semispecific binding 
competes with specific binding. This occur, .because one arm 
of the semispecificaUy bound peptide would cover hatf of he 
^ne^c binding site, preventing another dimer from binding 
^ provSi g fuU protection. This explains (0 why gel- 
afsays fFig. 2a) show fewer ^gg^ 
the pCN/CN complex compared to the pCC/CX ana 
pNN/NN complexes and («0 why footprintmg assays (F g. 
2 W show incomplete protection on the bindmg site and parual 
orotection on a few bases flanking the bmdmg site Such 
Lm spSic binding interferes with the *e-,pecuTc bmdmg 
and would eventually result in low production and abnor- 
mally slow growth. However, gel retardation shows no 
detctaoSe nonspecific or semispecific l^^g* 
concentration, indicating that semispecific bmdmg is s.gnrf 
Stly weaker than specific binding. After (hmenzauon. the 
Steins Stable for palindromic dimer binding sites avoid 
Specific DNA binding, leading to more selective : recog- 
nSof the specific sites. Thus, palindromic dimer bmdmg 
sites provide a good design for selective molecular recogni- 
tion JSdfor further flexibility the link can ahgn sites (Fig- 3) 

^^LroTiTthree dimers considered here provide 
en^rage^ 

or deigning and synthesizing proteins to iceognu long 
DNA sequences. Thus, for trimers to recognue 15-bp se 

ucne V are using an approach similar to that of Fig 1c 
involving appropriate use of cysteine ^ ka 8« M< ^f' r 
acuvaton. It seems possible to design proteins for 20 bp and 

10 Emmary, we find the Mowing^) ^ic^c^ of 
v-Jun leads to three dimers (pCC. pNN. and r^.each ot 
wS binds specifically to the ^P™" ^ 0 
DNA sites Thus, there is cooperation between the two 

the relative orientation of two monomers u ' the duner 00 
-These results provide strong 
model of the basic region when bound to DN A. (m) lhese 
Sts provide an explanation for the advantage 1 of tor; 
iration and the use of palindromic sites in the site-selective 
Sdi of pSteins to DNA. (/v) These re smts show protein 
sutcherv to be useful for establishing the conformation and 
mecSsmfSbindingofprotems^ 
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SECTION: NEWS; Pg . A17 
LENGTH: 260 words 

HEADLINE : Scientists Flip on Gene 'Switch' 
BYLINE: By Robert Cooke. STAFF WRITER 
BODY: 

Custom-made molecular switches that can turn genes on or off on cue are now 
being created in the laboratory, scientists said yesterday, showing promise as a 
powerful new way to attack genetic diseases, cancer and perhaps even AIDS. 

The new approach is analogous to making an artificial key to start a stalled 
car. By finding a way to turn on a dormant gene that makes normal hemoglobin, 
for example, it may become possible to rescue people with sickle cell anemia and 
similar blood disorders. 

The same technique may also be used to block gene activity, stopping the 
wildfire growth of cancer cells, or to shut off a virus' genes to abort an 
infection. An obvious target would be HIV, which causes AIDS. 

According to biologist Carl Pabo, at the Massachusetts Institute of 
Technology, laboratory tests have pr oved the artificial switch can find and 
control a single gene among the 80,000 that exist in humans. The protein 
molecule they created attached itself at exactly the right place among 3 billion 
links in the long, complex chain of human DNA. 

"The critical thing was showing it can find the proper site," Pabo said. But, 
he cautioned, much work must yet be done before any therapy for human diseases 
becomes possible. 

In the tests, the switch worked as hoped, activating the desired gene. It 
represents a "fundamental advance and a very powerful method, " Pabo said. 

A full report on the experiment was published in Science yesterday. Other 
members of the team were Joel Pomerantz, a graduate student, and biologist 
Philip Sharp, both at MIT. 

LANGUAGE : ENGLISH 
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Structure-Based Design of Transcription Factors 

Joel L. Pomerantz, Phillip A. Sharp, Carl O. Pabo 

Computer modeling suggested that transcription factors with novel sequence specificities 
could be designed by combining known DNA binding domains. This structure-based 
strategy was tested by construction of a fusion protein, ZFHD1 , that contained zinc fingers 
1 and 2 from Zif268, a short polypeptide linker, and the homeodomain from Oct-1. The 
fusion protein bound optimally to a sequence containing adjacent homeodomain (TA- 
ATTA) and zinc finger (NGGGNG) subsites. When fused to an activation domain, ZFHD1 
regulated promoter activity in vivo in a sequence-specific manner. Analysis of known 
protein-DNA complexes suggests that many other DNA binding proteins could be de- 
signed in a similar fashion. 



Transcription factors arc critical re^ulatoro 
ot £ene expression. The rational design ot 
transcription factors with novel DNA bind- 
ing specificities and regulatory activities 
will provide reagents tor both biological 
research and gene therapv. The recent de- 
termination of a series ot structures of pro- 
tein-DNA complexes ha- facilitated a de- 
sign strategy that uses computer modeling 
to predict how DNA binding domain^ 
could he combined to generate novel spec- 
ificities. We explored thi> strategy by de- 
signing and testing a zinc tinger- homeodo- 
main fusion protein. 

Computer modeling studies were used to 
visualize how zinc tinger- might he fused to 
the Oct-1 homeodomain The known crys- 
tal structures of the Ztf:6S-DNA (J) and 
Oct-1 -DNA (2) complexe- were aligned bv 
supenmposition of the double helices in 
several different registers. Two arrange- 
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ments were particularly interesting. In one 
alignment, the COOH-terminnl end ot :uk 
tinker 2 was 8.8 A away from the N'lD- 
terminal arm of the homeodomain (Fig. W 
which suggested that a short polypeptide 
linker could connect these domains. In thi- 
model, the fusion protein would bind a 
hybrid DNA site with the sequence 5'- 
AAATNNTGGGCG-3'. The Oct-1 ho- 
meodomain would recognize the A A AT 
subsite, zinc finger 2 would recognize the 
TGG subsite, and zinc finger 1 would rec- 
ognize the GCG subsite. There was no pos- 
sibility for stenc interference between the 
zinc fingers and the homcoxlomain in tho 
arrangement. Superimposition of the DNA 
duplexes in other registers generated a sec- 
ond plausible arrangement tor a hybrid pro- 
tein (3); however, this model wa^ not a^ 
favorable because there was a ri^k of steru 
interference between the zinc fnvjer- and 
the homeodomain. 

The design strategy w.i- tested by con- 
struction ot a tuoon protein. ZF11D1. that 
contained fmger> 1 and 2 of Zif 26 s , a do 
idv-anoanj linker, and the Oct-1 hoineod- *- 
mam (Fig. 2A). A glutathione-S-t 1 atotcr im 
UiST) domain u,b added tt - facilitate e.\- 
pre>s|on and purification, and the D v sA 
binding aUi\ itv of tho tu-ion protein u a- 
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dctcrmino.1 K -cUxthui hin J :i ^ Mto 
ti.-tii a r.tiiJ.'in y-.--.-l 'i iuv lc -i ;Jo. .Af- 

ter Kuir muikK ^'f >eL*cru>n. In >ito were 
clone J .in J .sequenced < Fil; 2B). l.\unpan- 
suti v it lIk-^c Sequence^ rc\\c*lcJ the con.>cn- 
su> HnaW >ire S'-TAATTANGGGNd- V 
(Fim 20. The 5' halt ot thi> conscnMh, 
TAATTA, roeinhleJ a canonical home- 
o^lornain hmdinc >ite (TAATN.O [4) and 
marched rhe >ire iTAATNA) tha: is pre- 
ferred bv the Oa-1 honu-odotnain in the 
absence ot the rOU -specific domain (5). 
The V halt ot the consensus, KGGGNG, 
resembled adjacent binding sires tor tinkers 
2 (TGG) and 1 [GCG) ot Zit'26S. The 
guanine residues were more tightly con- 
served than were the other posit u»ns in these 
:inc tinker subsites. and the crystal structure 
shows that these are the positions of the 
critical side-chain- base interactions (/). 

The ZFHP1 consensus sequence (5'-TA- 
ATTANGGGNG-V) matched the movie 1 
that appeared to be most structurally feasible 
(6), bur because ot the internal symmetry of 
the TAATTA subsite, this sequence was 
aUo consistent with the homeodomain bind- 
ing in another orientation (Fim 2D; compare 
mode 1 and mode 2). This jhernauve ar- 
rangement, in which the critical TAAT is 
oil the other strand and is directly juxtaposed 
with the :mc tinker ITCXjCjCU) subsites, 
was considered unlikely because modeling 
had snooted that this arrangement required 
a linker to span >20 A between the GOOH- 
terminus of fuvjer 2 and the NH , -terminus 




Fig. 1. Model of a z^c finger- homeodomain hy- 
brid. Fmge r 1 of Z'f2"53 is depicted in purple, fmge r 

2 in ye! and t r e Oct-1 homeodomain in red. 
The DNA is blue, w - the base pairs m the AAAT 
and TG3GCG cm vs h.g^ghted m 
mbmj i r cte.n reo;.cv ces a sequence r 
\ 'Am." ■ ' *N i C^Oo ^ _j ..." . Tne \j 



\f' 



c y am the 

f the form 
e ( Gi. ' 

■ m 



of rhe Iv Miickvb 'mam. To determine h^u rhe 
hoine^ »d- Mnain K>unJ to the TAATTA se- 
quence m the 5' halt ot the consensus, 
ZFH01 was tested for binding to probes 
( 5 ' -TAATG ATGGGCG- 3 ' and 5'-TCAT- 
TATGGGCG-3') designed to distinguish 



between these orientation^ ZfHPi b-nind 
m the S' -TAATG ATGGGCG -V pr.Tc 
with a disSvKTiation constant ot >.4 >■ 12 1 ' 
M and preferred this probe to the 5'-TCAT- 
T ATGGGCG 3' probe by a factor ot 3 3 
(Fit:- 3A; compare lanes 6 to 10 and 11 to 



A ZFHD1 

I 1 

Nhy | GST- finger 1 -finger 2 -homeo. \ - COOH 



... RTHTG GGRR RKKRT ... 

B 

GTTTGGCACCTGACTAATTTAAGGAG 

GCGTTAATTAAGGGAGGTAAGG CCC 
CTC GGCCGTT AATGAGGGGTGTTC G 

TAATTATGGGCGGGATC GAATAGCC 
GGCAATAATCAATCCTTTAATTATGG 

GGCCGTACCTCATGAAATTAGGGGCG 

GTTAATTATGGGGTAATAATGGTGC 
GTCGGGCTCTGTTAATTATGGGTGG 

GGATAATTACGGGTGGCATTTAGGC 
GATAAATAGGGGCGTCC CATCCC GT 
TAAATTAGGGCTTTAATTACGGTC 
TCATTAGAGTGTTAATGAGATGCGC 
TAGTTGCTAATTTGTATTAATTAAAG 

AGTTATTAATTAAGAATGTTAATTA 
GTGTGATAATGAGCTGGTCCGTCC C 
ATATTAAGGCGTAATTCGGACAAGA 
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Fig. 2. Selectman b. ZFHO* o* a 
hybrid bmdng s^te 'mm a rco:i c* 
random ohgonucleecdes (A Struc- 
ture of the fusion cmten used tc 
select binding s^tes ^9'- The ^~de r - 
Imed residues a r e ho— tne Zr26&- 
DNA (1) and Oct-1 -DNA [2, cr,sta: 
structures and cc^esoooc tc the 
termini used in the compute' mod- 
eling studies. The hnrer ccm.vns 
two glycine residues that /.em in- 
cluded for flexibility a- a tne tw: a r - 
ginine residues tha: a'e present a: 
positions -1 and "! cf the CcM 
homeodomain. (B. Sequences o* 
1 6 sites isolated a*te- fc ur rc^mos c* 
bmdmg site select, on [20). [C Con- 
sensogram denvea fom tne se- 
quences in (B) tnat indicates the 
percent occurrence o' each nucle- 
otide at each post Dm (D Sche- 
matic diagram liiusv at ng re two 
possible orientations cf the rume- 
odomain subsite re^atue tc re zinc 
finger subsite suggested t. the 
consensus. Mode i corresccncs tc 
the arrangement shcv.n in F.g. 1 . 
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Fig. 3. Comparison of the DNA 
binding specificity of ZFHD1, the 
Oct-1 POU domain (whic h contains 
a homeodomain and a POU-specif- 
ic domain), and the three Z'nc fin- 
gers from Zif268 (2 7). (A- The GST- 
ZFHD1 protein was titrated into 
DTnJA binding reactions containing 
the probe listed at tne top of each 
set of lanes Lanes 1,6. 11. and 1 6 
contain the protein at 9.8 x 10 " 
fvl. protein concentration was in- 
creased in three-:,' u increments m, 
subsequent lanes of each set. The 
position of the p r otem-DNA com: 
piex ts indicated cy an arrow. I'B 
The PA-Oct-1 PO'J fusion prote 1 ^ 
i >6: was titrated m*o pan^ie' DNA t ■ n 
at 2.1 ■ 10 ' f/ 
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wA\ [ / !an^?s 1.6, 1 1 .and 16 • 
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[-.n'ic DNAt;.nd:nu • t " 
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15V Thi- suggested thar the tir>r tour ba-e- of 
the con-ensu- sequence form the critical 
TAAT sulfite thar is recognized by the ho- 
meodomain (mode 1) and that ZFHD1 
bound as predicted in the model (Fig. 1). 

We compared ZFHD1, Oct-1, and 
Zif26S tor their abilities to distinguish 
among the Oct-1 site V-ATG- 
CAAATGA-3', the Zit76S site 5'- 
GCGTGGGCG-3', and the hybrid bind- 
ing sue 5 ' -T A ATG ATGGGCG - 3 ' . The 
fusion protein ZFHD1 preferred the opti- 
mal hybrid site to the octamer site by a 
factor of 240 (Fig. 3Ai compare lanes 1 to 
5 and 11 to 15) and did not bind to the Zit 
site (lanes 16 to 20). The POU domain of 
Oct-1 (Fig. 3B) bound to the octamer site 



with a diN-ociation constant ot l.S / 
10 !: M (lanes 1 to 5), preferring tin- site 
to the hybrid sequences by t actors ot 10 
(lanes 6 to 10) and 30 (lanes 1 1 to 1 5 ) and 
did not bind to the Zif site (lanes 16 to 20) 
(7). The three fingers of Zif26S (Fig. 3C) 
bound to the Zit site with* a dissociation 
constant of 3.3 X 10" IC M (lanes 16 to 
20) and did not bind to the other three 
sites (lanes 1 to 15). Thu> ZFHD1 bound 
tightly and specifically to the hybrid site 
and displayed DNA binding specificity 
that was clearly distinct from that ot either 
of the original proteins. 

We fused ZFHDl to a transcriptional 
activation domain and determined in trans- 
fection experiments whether the DNA 



Target; 
Fold 

induction: 



No insert ZFHD1 site Oct-1 site 
3.3 44.0 3.1 



Zf site 
3.5 



Fig. 4, Regulation of promoter act vity in vivo bv 
ZFH01. The 293 cells were cotransfected with 5 
of reporter vector, 1 0 of expression vector, 
and 5 of pCMV-hGH (gift of J. Parvm) used as 
an internal control (22). The reporter vectors con- 
tained two tandem copies of the ZF'HD i site ( TA- 
ATGATGGGCG). the Oct-1 site (ATGCAAATGA;, 
the Zr si:e (GCGTGGGCG), or no insert. The ex- 
pression vector encoded the ZFHD1 protein fused 
to the COOH- terminal 81 amino acids of VP16 
(plus signs), and the empty expression vector Ftc/ 
CMV was used as control (minus signs). The 
aTC -jnt of tuciferase activity obtained, normalized 
to hGH production, was set to 1 .0 far the cotrans- 
fecton of Rc/CMV with the no-insert reporter 

pGL2 -Promoter Bar graphs represent rhe average of three independent experiments. Actual values and 
standard deviation, reading from left to right, are: 1.00 - 0.05, 3.30 - 0.63; 0.96 - 0.08. 42.2 r 5.1; 
C.7G z 0.07, 2 36 * 0.34; and \27 n 0.10. 4.22 r 1.41. Fold induction refers to the amount of 
normalized act vity obtained with tne ZFHD1 -VP 16 expression construct divided by tnat obtained with 
Rc/CMV. 




B 



Fig. 5. Models of fusions of other DNA binding 
modules. (A) Putative fusion of zinc fingers with a 
basic -helix-loop-helir (bHLH) domain Finger 1 of 
Zit 263 is depicted in pink, finger 2 in yellow, and 
the VyC'D bHLH region (23) m red ana gray. The 
distance in angstroms between the COOH- termi- 
nus of f:nger 2 and the NH. -terminus cf the basic 
region of the bHLH domain is indicated. (B, A 
ro^atve zinc f nger-sterotd r ecepto r fuMon. Finger 
■ cf Zif268 is depicted ir fink, finger 2 in yellow, 
and tn-: giijcocortico'd receptor (24', in red and 
g-a, The d stance m angstroms between the COOH-termmus of finger 2 and the NH_. -terminus of the 
g'o"ocrirttcoid receptor is indicated T kju'es »v-k- generated with lns>ght II. 



finding protein could t 
expre^i. <n pla-mid enc 
to the COOH-termina 
the herpes simplex vim 



nc n. m m vivo. An 
vimg ZFHDl tu-ed 
M amino acids ot 
\T16 protein 



(ZFHDl ATI 6) was cotransfected into 293 
cells with reporter constructs containing 
the SY4C promoter and the hrerly luciterase 
gene. To determine whether the fusion pro- 
tein could specifically regulate gene expres- 
sion, we tested reporter constructs contain- 
ing two tandem copies of the ZFHD1 site 
5 ' -T A ATG ATGGGCG - 3 ' , the octamer 
site 5'-ATGCAAATGA-3\ or the Zif site 
5'-GCGTGGGCG-3' inserted upstream ot 
the SY4v promoter. When the reporter 
contained two copies of the ZFHD1 site, 
the ZFHDl -VP 16 protein stimulated the 
activity ot the promoter in a dose-depen- 
dent manner (3). Furthermore, the stimu- 
latory activity was spec it ic for the promoter 
containing the ZFHDl binding sites (Fig. 
4). At concentrations of protein that stim- 
ulated this promoter 44-fold, no stimulation 
above background was observed for promot- 
ers containing either the octamer or Zit 
sites. Thus, ZFHDl efficiently and specifi- 
cally recognized its target site in vivo 

Thi- structure-based strategy of fusing 
known DNA binding modules may provide 
a general method for designing transcrip- 
tion factors with novel DNA binding spec- 
ificities. Computer modeling suggests a 
number of other plausible arrangements for 
hybrid proteins. Figure 5 illustrates models 
of a zinc finger- basic -helix-loop-helix fu- 
sion protein (Fig. 5 A) and a :inc finger- 
steroid receptor fusion protein (Fig 5B) 
that should recognize hybrid binding sites. 
In each case, the modules can be fused by a 
short polypeptide linker without steric in- 
terference between the domains This strat- 
egy coulJ also be extended by variation in 
the length and sequence of the polypeptide 
linker^ an d then use of selection methods to 
optimize the binding affinity and specificity 
of the hybrid protein. 

The strategy of fusing modules can also be 
combined with strategies for changing the 
sequence specificity of individual modules. 
Several DNA binding domains are amenable 
to mutational strategies for changing se- 
quence specificity (9. Jv?), and zinc fingers 
may offer the most versatility ( / J ). Combin- 
ing structure-based design with mutational 
change- in specificity would greatly expand 
the rang'- of sequences that could be targeted 
by hybrid domains. 

The high affinity of ZFHDl for its opti- 
mal site and rhe fact that ZFHDl, Oct-1, 
and Zit-oS all clearly preferred different 
site- illustrate the success of the combina- 
torial approach. The specificity ot the hy- 
brid rratvcnprion factor depend- on the 
relatively Hitlerite .ittimty. bur high se- 
quence -pec if iv. 1 1 y . for the binding ot a mo- 
i']e module and on the cheDtc effect ( J -M 
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provided I v the covalent link or rm\l- 
ule>. The design criteria thu allowed the 
construction ot ZFHLH included the short 
length ot polypeptide linker that was re- 
quired to fuse the DNA bindine domains 
and the absence ot stenc interference be- 
tween these domains. 

Designed transcription tactors will he 
useful tor the targeted regulation ot specific 
cellular genes. The use ot particular DNA 
binding domains in a hybrid (or the addi- 
tion ot other domains) mav allow a protein 
to interact with other cellular tactors or to 
be modulated by a particular regulaton 
pathway. The structure-based design ot hy- 
brid transcription tactors should facilitate 
the development ot etticient and specific 
reagents tor biological research and gene 
therapy. 
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Footprint Analysis of Replicating Murine 
Leukemia Virus Reverse Transcriptase 

Birgitta M. Wohrl, Millie M. Georgiadis, Alice Telesnitsky, 
Wayne A. Hendrickson, Stuart F. J. Le Grice* 

Replication complexes that contained either murine leukemia virus reverse transcriptase 
(M LV RT ) or a variant reverse transcriptase without a nbonuclease (RNase) H domain (ARH 
MLV RT ) were visualized by enzymatic footprinting. Wild-type MLV RT protected template 
nucleotides -6 to -27, and primer nucleotides -1 to -26 of primers that had first been 
extended by one or four nucleotides. Although it catalyzed DNA synthesis, ARH MLV RT 
stably bound template-primer only under conditions of reduced ionic strength and pro- 
tected the duplex portion only as far as position -15. Despite altered hydrolysis profiles, 
both enzymes covered primarily the template-primer duplex, contradicting recent pre- 
dictions based on the structure of rat DNA polymerase p. 



Even though thev catalyze common reac- 
tions, retroviral rever>e transcriptases 
(RTs) .ire structurally diverge. Whereas 
human, equine, telme. and simian en- 
:\ine> share 1 heterodimeric orLianirarion 
of suhimr> encoded hy the RT p.ene ti-5), 
a Mil ■■unit > l t the avian vircoin.i-leukosis 
\ irus (A^l.Vl I'liivtiir retains the inte^raNe 
d' m un ■ >: th-' 2 i^-pol 1 ol\|Totein to) 
The is''!o.\l enoohe ot rh-.- murine leuke- 



mia virus (MLV) is a 75-kP monomer, 
whereas a 15c-kP homodimer 1^ proposed 
to analyze DNA synthesis (7). A structur- 
al motit common to RNa^e M ot hschcrich- 
hi to/, and MLV RT (u hehx III) is also 
ah^ent trom the human immunodeficiency 
virus (HIV) enzyme i^L Such observa- 
tion^ illustrate the import, uue ot compar- 
ative studies in understanding the evolu- 
tion ot these mult ituiu t ion il enzyme^. 
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